AITopics | Problem-Specific Architectures

Under this novel formulation, all policy optimization algorithms can be used off the shelf to learn intra-option policies, option termination conditions, and a master policy over options. We apply an actor-critic algorithm on each augmented MDP, yielding the Double Actor-Critic (DAC) architecture. Furthermore, we show that, when state-value functions are used as critics, one critic can be expressed in terms of the other, and hence only one critic is necessary. We conduct an empirical study on challenging robot simulation tasks. In a transfer learning setting, DAC outperforms both its hierarchy-free counterpart and previous gradient-based option learning algorithms.

artificial intelligence, machine learning, reinforcement learning, (14 more...)

Neural Information Processing Systems

Country: North America > Canada (0.28)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning > Optimization (0.49)
Information Technology > Artificial Intelligence > Systems & Languages > Problem-Specific Architectures (0.40)
Information Technology > Artificial Intelligence > Machine Learning > Transfer Learning (0.34)
Information Technology > Artificial Intelligence > Machine Learning > Reinforcement Learning (0.34)

Add feedback

Supplementary Materials for MLP-Mixer: An all-MLP Architecture for Vision, Lucas Beyer

Neural Information Processing SystemsMar-21-2025, 21:02:38 GMT

A.1 Modifying the token-mixing MLPs We ablated a number of ideas trying to improve the token-mixing MLPs for Mixer models of various scales pre-trained on JFT-300M. Instead, we could introduce C separate MLPs with independent weights, effectively multiplying the number of parameters by C. We did not observe any noticeable improvements. Grouping the channels together Token-mixing MLPs take S-dimensional vectors as inputs. Every such vector contains values of a single feature across S different spatial locations. In other words, token-mixing MLPs operate by looking at only one channel at once.

artificial intelligence, image understanding, machine learning, (17 more...)

Neural Information Processing Systems

Technology:

Information Technology > Artificial Intelligence > Machine Learning (1.00)
Information Technology > Artificial Intelligence > Vision > Image Understanding (0.77)
Information Technology > Artificial Intelligence > Systems & Languages > Problem-Specific Architectures (0.77)

Add feedback

Ukrainians are looking past NATO to a European security architecture

Al JazeeraFeb-19-2025, 12:14:04 GMT

Cambridge, United Kingdom – The fate of Ukraine and the future of European security hangs in the balance as United States and Russian diplomats prepared to discuss an accelerated peace plan this week. The uncertainty and dreadful possibilities of this historical moment, with Russia occupying a fifth of Ukrainian soil, dominated the atmosphere of Firewalling the Future, a conference on the future of Ukraine held at Cambridge University on Monday. Organised by programme leader Victoria Vdovychenko and professor of Ukrainian studies Rory Finnin under the auspices of the Centre for Geopolitics, it brought together Ukrainian, European and British diplomats, soldiers and academics. Dominant among the Ukrainians and Eastern Europeans present was the sentiment that with Trump's re-election, the international order is irrecoverably lost and needs to be rebuilt. Some spoke openly of a post-NATO reality in which Europe must form new structures and alliances to fend for itself.

artificial intelligence, europe, ukraine, (16 more...)

Al Jazeera

Country:

Europe > Ukraine (1.00)
North America > United States (0.94)
Europe > United Kingdom > England > Cambridgeshire > Cambridge (0.56)

Industry:

Government > Military (1.00)
Government > Regional Government > North America Government > United States Government (0.49)
Government > Regional Government > Europe Government (0.36)

Technology:

Information Technology > Security & Privacy (0.40)
Information Technology > Artificial Intelligence > Systems & Languages > Problem-Specific Architectures (0.40)

Add feedback

Supplementary Materials for MLP-Mixer: An all-MLP Architecture for Vision, Lucas Beyer

Neural Information Processing SystemsFeb-10-2025, 18:38:51 GMT

A.1 Modifying the token-mixing MLPs We ablated a number of ideas trying to improve the token-mixing MLPs for Mixer models of various scales pre-trained on JFT-300M. Instead, we could introduce C separate MLPs with independent weights, effectively multiplying the number of parameters by C. We did not observe any noticeable improvements. Grouping the channels together Token-mixing MLPs take S-dimensional vectors as inputs. Every such vector contains values of a single feature across S different spatial locations. In other words, token-mixing MLPs operate by looking at only one channel at once.

artificial intelligence, image understanding, machine learning, (18 more...)

Neural Information Processing Systems

Technology:

Information Technology > Artificial Intelligence > Machine Learning (1.00)
Information Technology > Artificial Intelligence > Vision > Image Understanding (0.77)
Information Technology > Artificial Intelligence > Systems & Languages > Problem-Specific Architectures (0.77)

Add feedback

A Survey on Mamba Architecture for Vision Applications

Ibrahim, Fady, Liu, Guangjun, Wang, Guanghui

arXiv.org Artificial IntelligenceFeb-10-2025

Transformers have become foundational for visual tasks such as object detection, semantic segmentation, and video understanding, but their quadratic complexity in attention mechanisms presents scalability challenges. To address these limitations, the Mamba architecture utilizes state-space models (SSMs) for linear scalability, efficient processing, and improved contextual awareness. This paper investigates Mamba architecture for visual domain applications and its recent advancements, including Vision Mamba (ViM) and VideoMamba, which introduce bidirectional scanning, selective scanning mechanisms, and spatiotemporal processing to enhance image and video understanding. Architectural innovations like position embeddings, cross-scan modules, and hierarchical designs further optimize the Mamba framework for global and local feature extraction. These advancements position Mamba as a promising architecture in computer vision research and applications.

artificial intelligence, arxiv preprint arxiv, machine learning, (20 more...)

arXiv.org Artificial Intelligence

2502.07161

Country:

North America > Canada > Ontario > Toronto (0.14)
Europe > Switzerland > Zürich > Zürich (0.14)

Genre:

Research Report (1.00)
Overview (1.00)

Industry: Health & Medicine (0.68)

Technology:

Information Technology > Artificial Intelligence > Vision > Image Understanding (1.00)
Information Technology > Artificial Intelligence > Systems & Languages > Problem-Specific Architectures (0.76)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.68)
Information Technology > Artificial Intelligence > Representation & Reasoning > Spatial Reasoning (0.68)

Add feedback

Reviews: DAC: The Double Actor-Critic Architecture for Learning Options

Neural Information Processing SystemsJan-23-2025, 14:14:55 GMT

Post-rebuttal update: I have read the rebuttal. Thanks for the clarification regarding they type of experiments where there is a larger gap between DAC and the baselines, as well as the clarification on PPO OC/IOPG. The paper proposes a new method for learning options in a hierarchical reinforcement learning set-up. The method works by decomposing the original problem into two MDPs, that can each be solved using conventional policy-based methods. This allows new state-of-the-art methods to easily be'dropped in' to improve HRL.

artificial intelligence, double actor-critic architecture, machine learning, (8 more...)

Neural Information Processing Systems

Genre: Research Report > Promising Solution (0.37)

Technology:

Information Technology > Artificial Intelligence > Machine Learning (0.87)
Information Technology > Artificial Intelligence > Systems & Languages > Problem-Specific Architectures (0.40)

Add feedback

DAC: The Double Actor-Critic Architecture for Learning Options

Shangtong Zhang, Shimon Whiteson

Neural Information Processing SystemsJan-23-2025, 14:14:54 GMT

Under this novel formulation, all policy optimization algorithms can be used off the shelf to learn intra-option policies, option termination conditions, and a master policy over options. We apply an actor-critic algorithm on each augmented MDP, yielding the Double Actor-Critic (DAC) architecture. Furthermore, we show that, when state-value functions are used as critics, one critic can be expressed in terms of the other, and hence only one critic is necessary. We conduct an empirical study on challenging robot simulation tasks. In a transfer learning setting, DAC outperforms both its hierarchy-free counterpart and previous gradient-based option learning algorithms.

artificial intelligence, machine learning, reinforcement learning, (14 more...)

Neural Information Processing Systems

Country: North America > Canada (0.28)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning > Optimization (0.49)
Information Technology > Artificial Intelligence > Systems & Languages > Problem-Specific Architectures (0.40)
Information Technology > Artificial Intelligence > Machine Learning > Transfer Learning (0.34)
Information Technology > Artificial Intelligence > Machine Learning > Reinforcement Learning (0.34)

Add feedback

Reviews: DAC: The Double Actor-Critic Architecture for Learning Options

Neural Information Processing SystemsJan-23-2025, 14:14:44 GMT

The paper introduces a double actor critic architecture for learning options. The authors define 2 augmented MDPs for learning the option selection policy as well as the options themselves. Using this MDP formulation, off-the-shelf policy learning algorithms can be used for learning option selection as well as option policies, which was not possible with previous algorithms. The reviews for this paper are borderline. Most reviewers appreciated the intutive idea and the promising results reported in the paper.

artificial intelligence, double actor-critic architecture, machine learning, (4 more...)

Neural Information Processing Systems

Technology:

Information Technology > Artificial Intelligence > Machine Learning (1.00)
Information Technology > Artificial Intelligence > Systems & Languages > Problem-Specific Architectures (0.64)

Add feedback

BiMLP: Compact Binary Architectures for Vision Multi-Layer Perceptrons

Neural Information Processing SystemsOct-10-2024, 06:44:40 GMT

This paper studies the problem of designing compact binary architectures for vision multi-layer perceptrons (MLPs). We provide extensive analysis on the difficulty of binarizing vision MLPs and find that previous binarization methods perform poorly due to limited capacity of binary MLPs. In contrast with the traditional CNNs that utilizing convolutional operations with large kernel size, fully-connected (FC) layers in MLPs can be treated as convolutional layers with kernel size 1\times1 . Thus, the representation ability of the FC layers will be limited when being binarized, and places restrictions on the capability of spatial mixing and channel mixing on the intermediate features. To this end, we propose to improve the performance of binary MLP (BiMLP) model by enriching the representation ability of binary FC layers. We design a novel binary block that contains multiple branches to merge a series of outputs from the same stage, and also a universal shortcut connection that encourages the information flow from the previous stage.

artificial intelligence, machine learning, vision multi-layer perceptron, (5 more...)

Neural Information Processing Systems

Technology:

Information Technology > Artificial Intelligence > Vision > Image Understanding (1.00)
Information Technology > Artificial Intelligence > Systems & Languages > Problem-Specific Architectures (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Perceptrons (1.00)

Add feedback

DAC: The Double Actor-Critic Architecture for Learning Options

Neural Information Processing SystemsOct-10-2024, 00:21:43 GMT

Under this novel formulation, all policy optimization algorithms can be used off the shelf to learn intra-option policies, option termination conditions, and a master policy over options. We apply an actor-critic algorithm on each augmented MDP, yielding the Double Actor-Critic (DAC) architecture. Furthermore, we show that, when state-value functions are used as critics, one critic can be expressed in terms of the other, and hence only one critic is necessary. We conduct an empirical study on challenging robot simulation tasks. In a transfer learning setting, DAC outperforms both its hierarchy-free counterpart and previous gradient-based option learning algorithms.

artificial intelligence, double actor-critic architecture, machine learning, (4 more...)

Neural Information Processing Systems

Technology:

Information Technology > Artificial Intelligence > Machine Learning (1.00)
Information Technology > Artificial Intelligence > Systems & Languages > Problem-Specific Architectures (0.40)

Add feedback

Filters

Collaborating Authors

Problem-Specific Architectures

DAC: The Double Actor-Critic Architecture for Learning Options

Supplementary Materials for MLP-Mixer: An all-MLP Architecture for Vision, Lucas Beyer

Ukrainians are looking past NATO to a European security architecture

Supplementary Materials for MLP-Mixer: An all-MLP Architecture for Vision, Lucas Beyer

A Survey on Mamba Architecture for Vision Applications

Reviews: DAC: The Double Actor-Critic Architecture for Learning Options

DAC: The Double Actor-Critic Architecture for Learning Options

Reviews: DAC: The Double Actor-Critic Architecture for Learning Options

BiMLP: Compact Binary Architectures for Vision Multi-Layer Perceptrons

DAC: The Double Actor-Critic Architecture for Learning Options